NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Normalization in Attention Dynamics

Karagodin, N; Ge, S; Polyanskiy, Y; Rigollet, P (September 2025, NeurIPS)

Free, publicly-accessible full text available September 30, 2026
PropBank goes Public: Incorporation into Wikidata

Spaulding, E; Conger, K; Gershman, A; Morshed, M; Brown, S; Pustejovsky, J; Uceda-Sosa, R; Ge, S; Palmer, M (March 2024, Association for Computational Linguistics)
Henning, S; Stede, M (Ed.)
This paper presents the first integration of PropBank role information into Wikidata, in order to provide a novel resource for information extraction, one combining Wikidata`s ontological metadata with PropBank`s rich argument structure encoding for event classes. We discuss a technique for PropBank augmentation to existing eventive Wikidata items, as well as identification of gaps in Wikidata`s coverage based on manual examination of over 11,300 PropBank rolesets. We propose five new Wikidata properties to integrate PropBank structure into Wikidata so that the annotated mappings can be added en masse. We then outline the methodology and challenges of this integration, including annotation with the combined resources.
more » « less
Full Text Available
Preserve your own correlation: A noise prior for video diffusion models.

Ge S; Nah S; Liu G; Poon T; Tao A; Catanzaro B; Jacobs D; Huang JB; Liu MY; Balaji Y. (October 2023, International Conference of Computer Vision)

Full Text Available
Building a Broad Infrastructure for Uniform Meaning Representations

Bonn, J; Buchholz, M; Chun, J; Cowell, A; Croft, W; Denk, L; Ge, S; Hajic, J; Lai, K; Martin, J; et al (May 2024, ELRA and ICCL)
Calzolari, N; Kan, M; Hoste, V; Lenci, A; Sakti, S; Xue, N (Ed.)
This paper reports the first release of the UMR (Uniform Meaning Representation) data set. UMR is a graph-based meaning representation formalism consisting of a sentence-level graph and a document-level graph. The sentence-level graph represents predicate-argument structures, named entities, word senses, aspectuality of events, as well as person and number information for entities. The document-level graph represents coreferential, temporal, and modal relations that go beyond sentence boundaries. UMR is designed to capture the commonalities and variations across languages and this is done through the use of a common set of abstract concepts, relations, and attributes as well as concrete concepts derived from words from invidual languages. This UMR release includes annotations for six languages (Arapaho, Chinese, English, Kukama, Navajo, Sanapana) that vary greatly in terms of their linguistic properties and resource availability. We also describe on-going efforts to enlarge this data set and extend it to other genres and modalities. We also briefly describe the available infrastructure (UMR annotation guidelines and tools) that others can use to create similar data sets.
more » « less
Full Text Available
Improving Cyberbullying Detection with User Interaction

https://doi.org/10.1145/3442381.3449828

Ge, S.; Cheng, L.; Liu, H. (April 2021, The Web Conference)
null (Ed.)
Cyberbullying, identified as intended and repeated online bullying behavior, has become increasingly prevalent in the past few decades. Despite the significant progress made thus far, the focus of most existing work on cyberbullying detection lies in the independent content analysis of different comments within a social media session. We argue that such leading notions of analysis suffer from three key limitations: they overlook the temporal correlations among different comments; they only consider the content within a single comment rather than the topic coherence across comments; they remain generic and exploit limited interactions between social media users. In this work, we observe that user comments in the same session may be inherently related, e.g., discussing similar topics, and their interaction may evolve over time. We also show that modeling such topic coherence and temporal interaction are critical to capture the repetitive characteristics of bullying behavior, thus leading to better predicting performance. To achieve the goal, we first construct a unified temporal graph for each social media session. Drawing on recent advances in graph neural network, we then propose a principled graph-based approach for modeling the temporal dynamics and topic coherence throughout user interactions. We empirically evaluate the effectiveness of our approach with the tasks of session-level bullying detection and comment-level case study. Our code is released to public.
more » « less
Full Text Available

Search for: All records